GenAI Poisoning: How Fewer Than 100 Samples Can Corrupt a Multi-Billion Parameter Model
pub.towardsai.net·7h
🛡️AI Safety
Flag this post
Agents Rule of Two: A Practical Approach to AI Agent Security
ai.meta.com·1h·
Discuss: Hacker News
🕳LLM Vulnerabilities
Flag this post
Context Engineering: The Foundation for Reliable AI Agents
thenewstack.io·2h
🪄Prompt Engineering
Flag this post
Too much social media gives AI chatbots ‘brain rot’
nature.com·11h
🏆LLM Benchmarking
Flag this post
AI browsers are here, and they're already being hacked
nbcnews.com·6h·
Discuss: Hacker News
🔓Hacking
Flag this post
AI scrapers request commented scripts
cryptography.dog·6h·
🕳LLM Vulnerabilities
Flag this post
Study: AI Models Trained On Clickbait Slop Result In AI ‘Brain Rot,’ ‘Hostility’
techdirt.com·10h·
Discuss: r/technews
🛡️Content Moderation
Flag this post
Emergent introspective awareness in large language models
transformer-circuits.pub·18h·
Discuss: Hacker News
🧠LLM Inference
Flag this post
Cloud CISO Perspectives: AI as a strategic imperative to manage risk
cloud.google.com·6h
🛡️AI Safety
Flag this post
This Week in Security: Vibecoding, Router Banning, and Remote Dynamic Dependencies
hackaday.com·2h
🔓Hacking
Flag this post
Stop Making Your Team Figure Out AI on Their Own
nngroup.com·5h
🪄Prompt Engineering
Flag this post
Will AI Strengthen or Undermine Democracy?
schneier.com·11h
🛡️Anthropic PBC
Flag this post
Introducing Project Telos: Modeling, Measuring, and Intervening on Goal-directed Behavior in AI Systems
lesswrong.com·13h
🛡️AI Safety
Flag this post
AI coding is moving faster than the guardrails meant to secure it and that's risky business.
blog.codacy.com·9h·
Discuss: r/programming
🛡️AI Safety
Flag this post
Vulnerability in Claude enables data leak via prompt
techzine.eu·13h
🛡️Anthropic PBC
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
youtube.com·5h·
Discuss: Hacker News
🖥GPUs
Flag this post
🚀 New Launch: Become an AI Engineer | Learn by Doing | Cohort 2!
blog.bytebytego.com·7h
🆕New AI
Flag this post
When APIs Become Attack Paths: What the Q3 2025 ThreatStats Report Tells Us
lab.wallarm.com·11h·
Discuss: Hacker News
🕳LLM Vulnerabilities
Flag this post